Adapting Transformer Networks for Document Summarization and Sentiment Analysis

نویسندگان

چکیده

Due to the increasing number of text-based content sources, demand for effective sentiment analysis and document summarization techniques has been increasing. Several transformer-based models, including “ELECTRA, BERT, XLNet, RoBERTa, DistilBERT, ALBERT” have emerged as promising alternatives traditional methods. This paper aims study effectiveness different transformer models performing on Yelp dataset. The analyze various models' performance tasks, identify their weaknesses, suggest possible improvements. It also thoroughly studies dataset, which over 5 million reviews. introduces that are used We then perform evaluation these using metrics measure performance. Some include ROUGE, F1-score, AUC-ROC, accuracy. According paper's experimental results, RoBERTa BERT better than other when it comes summarization. In addition, we identified weaknesses strengths each model. implementing domain-specific training fine-tuning improve results experiment revealed ones found strengths, contributes literature related use in documents by providing an extensive suggests modifications capabilities.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distraction-Based Neural Networks for Document Summarization

Distributed representation learned with neural networks has recently shown to be effective in modeling natural languages at fine granularities such as words, phrases, and even sentences. Whether and how such an approach can be extended to help model larger spans of text, e.g., documents, is intriguing, and further investigation would still be desirable. This paper aims to enhance neural network...

متن کامل

Document Embeddings for Arabic Sentiment Analysis

Research and industry are more and more focusing in finding automatically the polarity of an opinion regarding a specific subject or entity. Paragraph vector has been recently proposed to learn embeddings which are leveraged for English sentiment analysis. This paper focuses on Arabic sentiment analysis and investigates the use of paragraph vector within a machine learning techniques to determi...

متن کامل

Automatic Analysis of Document Sentiment

Sentiment analysis, which deals with the computational treatment of opinion, sentiment , and subjectivity in text, has attracted a great deal of attention. Potential applications include question-answering systems that address opinions as opposed to facts and business intelligence systems that analyze user feedback. The research issues raised by such applications are often quite challenging com...

متن کامل

Summarization and Sentiment Analysis for Understanding Socially-generated Content

ive Summarization. Generating abstracts for a set of relevant documents is a more challenging task than all the techniques mentioned above can handle, because it requires both techniques for language understanding and generation. Many factors need to be considered for building a complete abstract generation system, even for a specific domain or application. In addition to informativeness and no...

متن کامل

Decoupling Encoder and Decoder Networks for Abstractive Document Summarization

Abstractive document summarization seeks to automatically generate a summary for a document, based on some abstract “understanding” of the original document. State-of-the-art techniques traditionally use attentive encoder–decoder architectures. However, due to the large number of parameters in these models, they require large training datasets and long training times. In this paper, we propose ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The Philippine statistician (Quezon City)

سال: 2021

ISSN: ['2094-0343']

DOI: https://doi.org/10.17762/msea.v70i2.2325